NPS55-85-031 


NAVAL  POSTGRADUATE  SCHOOL 

Monterey,  California 


TECHNICAL 


TRANSFORMATION   OF   DISPLACEMENT 
AND  ORIENTATION   CORRECTION   DATA 

GENE   GYGAX 
ROBERT   R.    READ 

DECEMBER   1985 


FedDocs 

D   208.14/2 

NPS-55-85-031 


Approved   for  public   release;   distribution   unlimited 

Prepared   for: 

Naval    Undersea  Warfare  Engineering  Station 
^ynort,   WA     98345 


NAVAL  POSTGRADUATE  SCHOOL 
MONTEREY,  CALIFORNIA 


Rear  Admiral  R.  H.  Shuraaker  D.  A.  Schrady 

Superintendent  Provost 


Reproduction  of  all  or  part  of  this  report  is  authorized. 


This  report  was  prepared  for  Naval  Undersea  Warfare  Engineering 
Station  by: 


Unclassified 


eCufliry  classification  of  this  page 


REPORT  DOCUMENTATION  PAGE 


a.  Rfc'ORT  SECURITY  CLASSIFICATION 

Unclassi  fied 


lb    RESTRICTIVE    MARKINGS 


i.  SECURITY  CLASSIFICATION  AUTHORITY 


0.  DECLASSIFICATION /DOWNGRADING   SCHEDULE 


3     DISTRIBUTION/ AVAILABILITY  OF   REPORT 

Approved   for  public   release;   distribution 
is   unlimited 


PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 

NPS55-85-031 


S    MONITORING  ORGANIZATION   REPORT   NUMBERS) 


a.  NAME  OF  PERFORMING  ORGANIZATION 

Naval    Postgraduate  School 


6b    OFFICE  SYMBOL 
(If  applicable) 


7a.  NAME   OF   MONITORING  ORGANIZATION 

Naval  Undersea  Warfare  Engineering  Station 


c.  ADDRESS  (Gfy,  State,  and  ZIP  Code) 

Monterey,   California     93943-5100 


7b    ADDRESS  (Ory,  State,  and  ZIP  Code) 


Keyport,   Washington     98345 


a.  NAME  OF  FUNDING /SPONSORING 
ORGANIZATION 

aval  Undersea  V7arf  are  Engineeriig 


c.^'dtittiii  (City,  State,  and  ZIPCode) 

Keyport,  Wa  98345 


8o    OFFICE  SYMBOL 
(If  applicable) 


9    PROCUREMENT  INSTRUMENT  IDENTIFICATION   NUMBER 

MIF  O/H 


10    SOURCE  OF   FUNDING   NUMBERS 


PROGRAM 
ELEMENT  NO 


OROjECT 
NO 


TAS< 

NO. 


A'OR-     JNIT 
ACCESSION   NO 

N00253 

— 


""*n  n  n  "i 


TITLE  (inciuae  Secunty  Classification) 

TRANSFORMATION  OF  DISPLACEMENT  AND  ORIENTATION  CORRECTION  DATA 


2  PERSONAL  AUTHOR(S) 

Gyqax,  Gene  and  Read,  Robert  R. 


3a.  TYPE  OF  REPORT 

Technical 


13b.  TIME  COVERED 
FROM  TO 


14    DATE  OF  REPORT    (Year.  Month.  Day) 

85,    December  


15    PAGE   COUN: 

32 


6    SUPPLEMENTARY   NOTATION 


COSATl  CODES 


FIELD 


GROUP 


SUB-GROUP 


18    SUBJECT  TERMS  [Continue  on  reverse  it  necessary  and  lOentity  oy  diock  numoer) 

Bivariate   Data,   Transformations,   Principal    Components 


3   ABSTRACT  (Continue  on  reverse  if  necessary  and  identify  by  block  numoer) 

The  simulation  of  an   underwater  vehicle   track   has   been   used  to  develop   reference 
distributions    for  displacement  and  orientation   corrections   to   the   test   range  sensor 
arrays.     These   distributions   show  unusual    behavior.     Of  particular  concern   are   the 
scatter  plots   of  the   magnitude  of  displacement  and  orientation   rotation   angle,   which 
exhibit   a   "hairpin"   shape.      The   mechanisms    producing  this   shape   are    interpreted  and 
data  transformations    are   developed  that   produce  scatter  plots   of  more   regular   form. 
In  addition,   some   plotting  methods   are   applied  to   provide   some   visual    information 
about   the  joint  distribution   of  the   transformed   data. 


0    DISTRIBUTION/  AVAILABILITY  OF  ABSTRACT 
E  UNCLASSIFIED/UNLIMITED       D   SAME  AS   RPT  Q  DT'C   USERS 


21     ABSTRACT  SECURITY   CLASSIFICATION 


2a    NAME  OF   RESPONSIBLE  IND1V1DUAL 

Robert   R.    Read 


220  TELEPHONE  (incluoe  Area^oae) 

(408)646-2382 


22c    OFFICE   SYMBOL 

Code   55Re 


O    r  /-»  r\  i 


Bj  APR  poition  ~iav  ne  used  until  exnauued 
"  ;  ■y^y  ed  '  0~S  jro  oo'.oiote 


c  c  r  1 1  p  |  t  v   C  L  A  S  S ! ' '  C  A  T 


I.   Introduction 

Imagine  the  scatter  plot  of  data  from  two  strongly  correlated  variables 
with  the  provision  that  only  the  magnitudes  of  the  individual  variables  are 
recorded.  Such  data  can  produce  "hairpin"  shaped  scatter  plots  (see  Figure 
1 )  and  it  would  be  useful  to  develop  methodology  for  straightening  the 
hairpin,  i.e.  for  assigning  negative  signs  to  some  of  the  values.  Then 
perhaps  one  or  more  of  the  following  could  be  achieved  with  the  transformed 
data: 

i.   The  scatter  plots  could  be  fitted  with  bivariate  density 
functions,  whose  contours  are  convex  sets, 
ii.   The  scatter  plots  could  be  fitted  with  bivariate  normal 
density  functions. 

In  general  this  problem  is  a  difficult  one.   Some,  but  not  all,  of  the 
points  that  fall  near  a  coordinate  axis  need  to  be  reflected  across  that 
axis.   A  rule  for  making  such  selections  should  have  some  depth.   The 
present  work  does  not  treat  the  general  problem.   We  are  more  interested  in 
expediency  so  that  the  general  nature  of  the  straightened  scatter  plot  may 
be  better  understood.   Also,  our  data  structure  contains  concomitant 
information  that  will  be  exploited. 

The  mother  project,  of  which  this  is  a  part,  deals  with  the  maintenance 
of  calibrations  of  the  3-D  underwater  test  ranges  operated  by  NUWES.   The 
tracking  of  vehicles  is  performed  by  a  grid  work  of  three  dimensional  sonar 
arrays  located  underwater  at  the  bottom.   A  number  of  these  individual 


arrays  contribute  tracking  information  which  is  integrated  to  reconstruct 
the  path  of  the  targeted  vehicle.  Mismatches  occur  when  the  splicing  of 
individual  contributions  is  done  and  the  possibility  that  they  are  caused  by 
sensor  movement  must  be  faced. 

Fuller  details  can  be  found  in  References  1 ,  2,  3.   Of  immediate 
concern  is  the  product  of  the  program  KEYMAIN  which  takes  data  from  the 
array  overlap  region  and  estimates  the  relative  displacement  and  orientation 
corrections  that  minimize  the  track  splicing  mismatch.   To  judge  the 
significance  of  these  estimates  one  needs  a  reference  distribution  which 
describes  probabilistically  the  amount  of  variability  that  must  be  tolerated 
in  a  given  situation.   A  simulation  model,  whose  purpose  is  to  generate  such 
reference  distributions,  has  been  developed  and  is  described  in  Reference  1. 
It  simulates  the  path  of  the  vehicle,  complete  with  inherent  or  random 
variability,  for  each  of  the  two  sensor  arrays  in  an  overlap  region.   The 
results  are  passed  to  the  program  KEYMAIN,  which  estimates  the  corrections 
to  one  array  relative  to  the  other  one. 

The  output  of  KEYMAIN  contains  a  pair  of  three  dimensional  vectors. 
The  first  is  the  set  of  three  displacement  corrections  (downrange, 
crossrange,  and  vertical)  for  an  array's  location.   The  second  is  the  set  of 
three  Euler  angles  that  describe  the  orientation  correction  for  the  array. 
Use  is  made  of  two  other  ways  to  describe  these  latter  corrections: 
(i)   The  orthonormal  matrix  B  which  provides  the  information'  in  the  form  of 
a  linear  transformation.   (ii)   The  axis  direction  numbers  and  the  angle  of 
rotation. 


It  is  hoped  that  these  six  pieces  of  information  can  be  replaced  with 
two,  namely  the  magnitude  of  the  displacement  and  the  angle  of  rotation. 
The  joint  distribution  of  these  two  can  be  economically  plotted  on  a  piece 
of  paper,  and  it  seems  reasonable  that  this  should  be  adequate  to  support 
the  decision  processes  mentioned  above.   Such  are  the  data  in  Figures  1  and 
2. 

Some  additional  unusual  behavior  is  exhibited  in  Figure  2;  namely, 
there  seems  to  be  a  "base  line  of  support"  which  the  bivariate  data  do  not 
cross.   The  mechanism  that  produces  this  is  not  understood  at  this  writing, 
and  this  feature  will  not  be  treated. 

The  organization  of  the  report  is  as  follows:   Section  II  contains  the 
mathematical  development  of  methodology  to  "  straighten  the  hairpins"  and 
perform  ad  hoc  graphic  tests  for  the  bivariate  normal  distribution.   Section 
III  contains  the  application  of  this  methodology  to  our  six  cases  of 
simulated  data.   A  summary  follows. 

II .   Mathematical  Support 

The  simulated  data  are  six  dimensional  and  are  best  viewed  as  two 
strongly  correlated  3-D  sets;  the  three  components  of  displacement  and  the 
three  components  of  orientation.   As  a  data  reduction  technique  let  us  draw 
down  to  two  dimensions;  the  magnitude  of  displacement  d,  and-  the  magnitude 
of  the  angle,  0,  of  reorientation  about  an  axis  of  rotation.   It  is  the 
scatter  of  0  on  d  that  exhibits  the  unusual  behavior  (e.g.  Figure  1). 


The  hairpin  shape  is  readily  interpreted  once  one  is  aware  that 
displacement  and  orientation  are  strongly  correlated,  and  that  the  3~D  data 
swarm  of  displacement  is  very  long  and  narrow.   That  is,  the  variability  of 
displacement  is  dominated  by  its  first  principal  component.   At  one  end  of 
this  principal  component  both  0  and  d  are  large.   They  both  become  smaller 
as  one  moves  away  from  the  extreme  and  toward  the  center  of  the  principal 
component.   But  one  of  these,  say  0,  reaches  its  minimum  first  and  starts 
becoming  larger  again  before  the  other,  d,  achieves  its  minimum  value.  This 
may  cause  an  observable  "arch  on  the  pin".   Then  the  data  fills  out  the 
other  leg  of  the  pin  as  we  move  to  the  other  extreme  of  the  principal 
component.   Figures  3  and  4  bear  this  out.   These  are  scatter  plots  of  d, 
resp.  0,  on  the  first  principal  component,  u.   The  key  to  straightening  the 
hairpins  may  be  found  in  assigning  negative  values  to  the  left  leg  of  these 
"v"  shaped  scatter  plots. 

This  section  contains  the  mathematical  support  needed  to  straighten  the 
hairpin  and  perform  an  ad  hoc  test  for  bivariate  normal  distribution  to  the 
result.   It  contains  three  subsections:   (i)   Development  of  formulas 
relating  the  three  ways  to  represent  the  orientation  corrections.   (ii) 
Discussion  of  methods  to  assign  negative  signs  to  some  of  the  d  and  0. 
(iii)   Development  of  an  ad  hoc  test  for  bivariate  normal  distribution. 
They  are  presented  in  order. 


Subsection  (i).   Three  Ways  to  Represent  the  Orientation  Correction 

The  three  ways  are: 

1)  Orthonormal  matrix  (B). 

2)  Euler  angles  (<J>i,  <J>2,  4>3). 

3)  Axis  (x)  and  rotation  (0). 

Each  has  its  advantages.   We  proceed  to  describe  them  and  show  how  to 
transform  from  one  to  the  other. 

T 
1.   An  orthonormal  matrix  B  is  defined  by  the  property  B  B  =  I  (the  identity 

matrix)  and  may  be  viewed  as  a  linear  transformation  that  preserves 

distance.   Thus,  if  x  and  y  are  two  vectors,  the  distance  separating  them 

can  be  computed  from 


T 
|x  -  y||2  =  (x  -  y)  (x  -  y)  -  Z(x     -   y  ) 


To  illustrate  the  distance  preserving  property,  let  x'  =  Bx  and  y'  =  By 
represent  the  images  of  x  and  y  under  the  linear  transformation  B.   Then 


x'  -  y'||2  =  | | Bx  -  By||2  =  ||B(x  -  y)||2  =  (B  (x  -  y)T(B(x  -  y)) 


=  (x  -  y)T  BTB(x  -  y)  =  (x  -  y)T(x  -  y)  =  ||x  -  y|| 


Thus  the  distance  between  x  and  y  and  the  distance  between  their  images  is 

the  same.   These  tranformations  are  the  rigid  rotations.   Because  of  the 

T 
constraint,  B  B  =  I,  the  matrix  B  has  only  three  degrees  of  freedom.   Three 


are  lost  because  each  column  must  have  length  unity,  and  three  more  are  lost 
because  each  pair  of  columns  must  be  orthogonal. 

2.   The  three  Euler  angles  specify  the  rotations  that  take  place  in  the  base 
planes  of  the  coordinate  system  as  each  axis  is  held  fixed.   Since  the 
positions  of  these  planes  change  with  each  rotation,  it  is  necessary  to 
specify  the  successive  order  in  which  the  axes  are  held  fixed.   We  choose  to 
first  hold  the  x-axis  (downrange)  fixed  and  rotate  the  y  -  z  plane  through 
all  angles  of  <J>X;  then  hold  the  y-axis  (crossrange)  fixed  in  its  new 
position  so  as  to  rotate  the  (new)  x  -  z  plane  through  an  angle  <J>25  followed 
by  holding  the  z-axis  (vertical)  fixed  in  its  (twice)  tranformed  position 
and  rotate  the  x  -  y  plane  through  an  angle  cf>3. 

In  navigation  work,  these  three  angles  are  called,  roll,  pitch,  and 
yaw.   In  the  NUWES  software  they  are  designated  as  Y-TILT,  X-TILT,  and 
Z-ROT. 

The  Euler  angles  are  easily  related  to  the  B-Matrix.   Each  is  a 
rotation  in  a  plane  of  two  coordinate  axes  while  the  third  is  held  fixed. 
It  follows  that  each  rotation  has  its  own  representation  as  an  orthonormal 
matrix.   Thus,  letting 


S.  =  sin(4>.)         and        C.  =  cos(d>.) 
11  11 


for   1-1,    2,    3 ,   we  can  specify: 


\  1       0        o  ]  \c2    o     s2l  (c3     -s3 

B1=  <  0        Cl-S1i  B2=         JO         1         0     >  B3=  j  S  3 

(o        Si        d\  [S2     0        C2\  (0  1 


(Note  that  we  have  specified   clockwise,    or   "back"   rotation.      Then,    using  the 
stated  order   of   application, 

B   =   B  3B  2B  j 

and  the  algebraic  result  of  these  multiplications  is 

\  ^  2   3  \  ^  l  ^  2   3      ^  1  ^  3  /      \  ^  1  ^  2   3   ™"     13' 

B  =    <   C2S3     -(S^jSg  -  CjC,)   -(C1S2S3  +  S^a) 

V.Oo  12  12 

Given  B,  the  Euler  angles  can  be  recovered  in  the  following  way, 
Compute  principle  value  solution  using 

sin(4>2)   =  b31      sinCcf^)   =  b  3  2/cos(<j>2)      sin(4>3)   =  b2 /cost^) 


where 


-it/2    <   d>.    <    tt/2  for  i    =    1  ,2,3. 

l 


Then  check  for  possible  modification  of  <J>  t  and  <j>3  using  the  signs  of  bxl   and 
b33.   That  is,  make  replacements. 


j      «■  tt   -  <J>,    if   b3  3    <  0 


3      *  it   -  <j>3    if  bj  x    <  0 


3.   The  axis  and  rotation  representation  gives  recognition  to  the  fact  that 
a  rigid  turn  in  three  space  must  involve  the  turning  of  the  entire  space 
through  an  angle  of  0  about  an  axis  x.   The  only  points  unmoved  by  this 
transformation  are  the  points  on  the  axis.   Thus 


ix  =  x 


and  the  axis  x  is  identified  as  the  eigenvector  of  B  having  eigenvalue  one. 
In  order  to  make  it  unique,  let  us  agree  that  x  is  normalized  so  that 

|x||2  =  1  and  it  lies  in  the  half  space  defined  by  a  non-negative  first 
component,  Xj  £  0.   Note  there  are  two  degrees  of  freedom  in  this  x. 

To  determine  the  rotational  angle  0,  let  us  set  up  a  righthanded 
coordinate  system  with  x  as  its  first  axis,  and  then  determine  the  angle  of 
rotation  resulting  from  applying  B  to  any  vector  orthogonal  to  x.   Our  goal 
is  to  construct  a  change  of  basis  matrix  C  which  has  x  as  its  first  row.   It 

remains  to  specify  the  second  and  third  rows,  y  and  z.   Since  x  is  pointed 

T 
in  the  half  space  X[  >  0,  let  us  apply  the  Gram-Schmidt  process  to  e2  = 

(0,1,0)  in  order  to  specify  y. 


I.e. 


y  -  e2  -  (x, ,e2)x 


followed  by  a  normalization  to  give  it  length  one.   Thus 


y  =  (-XjX2,  1  -  x22,  -x2x3)/  /I -  x22 


In  the  construction  of  z  we  want  to  be  assured  that  the  result  is  a 
right  handed  system,  so  we  use  the  vector  cross  product. 


z  =  x  x  y  =  det 


e !   e2   e; 
x  j    x2    x • 

yi   y2   y; 


where  the  representation  by  the  determinant  is  a  schematic  one;  inferring  an 
expansion  by  the  first  row,  whose  components  are  the  unit  coordinate 
vectors. 

The  result  is 


=  (-x3,  0,  x,)//! -  x 


Let  us  apply  our  change  of  basis  matrix  C 


C  =  (x,y,z) 


to  the  vector  By.   From  geometrical  considerations,  we  have  By  rotates  y 
through  the  angle  0  and  its  representation  in  the  basis  C  must  be 


CBy  =  ^cos(O) 
/sin(O) 


The  third  component  can  be  inverted  to  compute  the  angle  9.  A  check  of  the 
sign  of  the  second  component  will  tell  us  the  quadrant  to  which  9  belongs. 
Thus  x  and  9  can  be  constructed  from  the  orthonormal  matrix  B. 

To  reverse  the  process,  i.e.  to  construct  B  from  the  axis  x  and  angle  ( 
we  can  proceed  as  follows: 

First  construct  the  orthonormal  matrix  C  from  x  exactly  as  before. 
Then  apply  the  matrix 


\*     = 


1      0      0 
0   cos(9)   sin(0) 
0   -sin(9)   cos(9) 


to  C.   This  will  rotate  the  plane  of  the  second  and  third  coordinates 
through  the  angle  9  with  the  first  coordinate  held  fixed.   One  can  verify 
easily  that 

B*CBx  =  ex      B*  CBy  =  e2      B*  CBz  =  e3, 


10 


i  .e. 


B*CBCT  =  I 


and  from  this  equation  we  may  solve  for  B. 

T   T 

B  =  CB'C 


Subsection  (ii).   Assignment  of  Signs. 

One  first  needs  the  values  of  u,  the  first  principal  component  of  the 
3-D  displacement  data  swarm.   We  simply  state  that  if  x  is  a  set  of  3-D 
displacement  vectors,  and  p  is  the  eigenvector  corresponding  to  the  largest 
eigenvalue  of  the  covariance  matrice  of  {x},  then  the  values  of  u  are  the 
dot  products 

u  =  p  • (x  -  x) 

where  x  is  the  centroid  (average)  of  the  data  swarm. 

When  one  regresses  either  d  or  0  on  u,  the  resulting  scatter  plot  has 
the  appearance  of  a  "v",  or  an  absolute  value  function.   A  difficulty  is 
encountered  in  that  the  minimum  of  d  or  0  is  not  necessarily  (sufficiently 
close  to)  zero.   This  creates  the  need  to  estimate  values  d  and  0  so  that 
the  replacements 


d^d-dQ       o  <-  0  -  0Q 


1 1 


will  produce  a  continuous  regression  scatter  plot  of  d  (resp.  0)  on  u  after 
the  ordinate  values  on  the  left  leg  have  by  has  been  assigned  negative 
signs.   Our  approach  to  this  problem  has  been  an  expedient  one:   First  re- 
index  0,d,  and  u  according  to  the  rank  of  u  (i.e.  increasing  order).   Next 
select  the  minimum  d(0)  of  d(0)  and  let  i(j)  be  the  index  of  d(0).   Then  let 


d  =  median  (d.  , ,  d.,  d.  ,) 
0  i-1    l   i+1 


©Q=  median  O.  1 ,  0.,  9.+1 ) 


It  will  be  seen  that  this  choice  is  modestly  successful 


Should  the  overall  transformation  technique  be  sufficiently  promising, 

it  will  be  necessary  to  develop  a  more  refined  method  to  estimate  d  ,  0  . 

An  obvious  first  thought  in  this  direction  would  be  to  fit  regression  lines 

to  the  two  separate  legs  and  use  the  intersection  point.   This  seems 

attractive  except  in  those  cases  for  which  either  d_  or  0_  is  naturally 
^  0     0  J 

close  to  zero.   There  some  of  the  values  in  the  scatter  plot  have  already 
been  reflected,  e.g.  observe  the  "mushy"  bottom  in  Figure  3.   This  situation 
interferes  with  our  ability  to  fit  good  lines. 


12 


Subsection  (iii)   Graphical  Method  to  Judge  the  Appropriateness  of  the 
Bivariate  Normal  Model. 

A  two  dimensional  scatter  plot  that  appears  consistent  with  a  straight 
line  regression  model  might  be  studied  further  for  distributional  properties 
to  determine  whether  the  bivariate  normal  model  is  tenable.   An  important 
and  easily  exploitable  property  of  this  model  is  that  it  can  be  tranformed 
to  a  circular  normal  random  vector  (X,Y)  and,  upon  transforming  to  polar 
coordinates 

R2  =  X2  +  Y2 
PV  =  tan"1 (Y/X) 

0  =  PV  for  X  >  0 

=  PV  +  tt        for  X  <  0  and  Y  >  0 
=  PV  -  tt        for  X  <  0  and  Y  >  0 

we  have  the  exponential  (1/2o2)  distribution  to  describe  R2  and  the  uniform 
( — it ,  it)  distribution  to  describe  0.   The  two  quantities  are  independent  and 
the  two  distributions  are  readily  verified  with  the  use  of  simple  plotting 
methods . 

It  seems  wise  to  take  a  moment  and  verify  these  properties.   The 
circular  normal  density  is 

\  1  1 

f(x.y)  -  -2^r  exp   -  -j^(x2  +  y2) 


13 


The  inverse  polar  transformation  is 


x  =  r  cos(0)   y  =  r  sin(Q)   dxdy  =  rdrde, 


Use  of  this  substitution  produces 


-r2/2o2 


(r,0))  =^re        =  gR(D^0) 


r        -r2/2o2 
where         gD(r)   =  — —  e  ,   0   <   r  <   °° 

1 

and  %^Q^    =      2tt      '  -tt   <   0   ^   ir. 


The  result  will  be  completed  when  we  make  the  change 


s  =  r2     ds  =  2rdr 


so  that 


t    ^      1    "S/2a2     n  . 
gs(s)  =  2^2-  e      ,   0  <  s  < 


Thus  (B)  is  distributed  uniformly  as  (— it, it) 


and  S  =  R2  is  exponential  (1/2o2). 
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Histograms  of  ©  and  2,n(R2)  should  exhibit  a  constant  value  behavior, 
and  a  scatter  plot  of  Qj)  on  R(or  R2)  should  not  suggest  dependence. 

The  preliminary  step  of  transforming  the  original  data,  say  (W,Z)  into 
potential  circular  normal  form  needs  to  be  described: 

First  make  the  replacements 


w  <■  (w  -  w)/S 
w 

z  «-  (z  -  z)/S 
z 


where  w  and  z  refer  to  the  original  center  of  gravity  of  the  data -and  S  and 

°  a  j  w 

S  are  the  sample  standard  deviations.   Then  compute  the  correlation 
z 

coefficient 


1     N 

P  "  m 1   £   w-  z- 

N  -  1  .  ,   li 
1=1 


where  N  is  the  number  of  data  vectors.  This  done,  the  joint  distribution  of 
W,  Z  is  viewed  as  being  a  standard  bivariate  normal  with  correlation  p.  The 
final  step  is  to  apply  the  transformation 


x  =  w 


y  =  (z  -pw)//  1  -  p: 
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It  is  easily  verified  that  (X,Y)  has  a  circular  normal  distribution  and  the 
variance  is  unity. 

The  additional  step,  taking  (X,Y)  to  (R2,©)  was  described  earlier.   In 
addition  to  (or  in  place  of)  the  histogram  plots  mentioned  earlier,  one  can 
apply  quantile  plots,  i.e.  theoretical  quantiles  plotted  against  empirical 
quantiles.   The  value  of  these  plots  lies  in  the  fact  that  they  should 
produce  reasonable  looking  straight  lines  if  the  theoretical  distributions 
are  well  chosen.   There  is  great  sensitivity  in  the  tails  (extremes)  of 
these  plots  however. 

The  empirical  quantiles  are  obtained  from  the  order,  statistics: 


%fpi)  =  °(i) 


!s(pi}  =  S(i)  =  r(i) 

p.  =  i/(N  +  1 )      for  i  =  1  ,  . .  .N 


The  corresponding  theoretical  quantiles  are  obtained  from 


C^p.)  =  ,(2p.  -  1) 
Q  (p.  )  =  -S,n(1  -  p.) 

SI  1 
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III.  Analysis  of  Application. 

The  data  sets  used  are  those  available  from  Reference  1 ;  a  Mark  10 
track  at  Nanoose  on  9  September  1982.   The  particulars  are: 


Point  No.  of 

Data  Set     Sensor  Pair     Count  Interval     Usable  points 


3933,3999  67 

4028,4097  70 

4405,4458  53 

6909,7001  93 

7079,7162  82 

7647,7738  87 


Each  of  these  six  situations  were  simulated  200  times,  and  the 
displacement  and  orientation  corrections  were  computed  each  time.   Scatter 
plots  of  0  on  d  appear  in  Figure  1  and  2.   To  get  a  reasonable  view  of  these 
data,  it  was  necessary  to  remove  some  wild  simulated  points.   Specifically: 

Data  Set  ONE    TWO   THREE    FOUR    FIVE   SIX 

Outliers  Removed    1      24     41      0      0 

The  outlier  removal  rule  was:   Drop  the  values  if  either  d  >  1 00  f t  or 
0  >  0.5  radian. 


ONE 

4,5 

TWO 

4,5 

THREE 

4,5 

FOUR 

5,6 

FIVE 

5,6 

SIX 

5,6 
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The  hairpin  shape  is  obvious  for  data  sets  one  and  two.   It  will  be 
seen  that  it  is  also  clearly  present  in  data  sets  three  and  six  as  well. 
Data  sets  four  and  five  seem  to  have  a  boundary.   No  interpretation  for  this 
is  offered  at  this  time.   The  hairpin  effect  may  be  explained  as  the  scatter 
plot  of  the  absolute  values  of  two  variables  that  are  (otherwise)  linearly 
related. 

Some  other  features  of  Figure  1  and  2  are  noted  in  passing.   Generally 
the  first  three  data  sets  have  larger  values  for  d  and  0  than  do  the  other 
three.  This  fact  could  be  related  to  one  or  both  of  the  following  facts: 
(i)   The  valid  point  count  size  is  smaller  for  the  first 
three;  67,  70,  53  vis  a  vis  93,  82,  87. 
(ii)   The  sensor  array  pair  is  different;  (4,5)  vis  a  vis 
(5,6). 
Figures  3  and  4  contain  the  scatter  plots  of  d  and  0  versus  the  first 
principal  component  of  their  respective  displacement  data  sets.   The 
efficacy  of  using  a  principal  component  to  represent  a  multivariate  data  set 
is  dependent  upon  the  percent  of  total  variability  that  is  represented  by 
that  component.   These  figures  are  recorded  below: 

Data  Set  ONE  TWO  THREE  FOUR  FIVE  SIX 

Percent   of  Total   Variance        97.0  74.6  59.3  85.5  99.4  99.1 

Total  Variance  6349.6        14564.2      275935.4  13-1         119.8        793.8 


It  is  interesting  to  note  that  the  two  "non  hairpin"  cases,  four  and  five, 
are  of  conspicuously  low  variability.   The  appearance  of  the  "v"  shape  is 
suggestive  of  the  absolute  value  of  an  otherwise  signed  variable.   Since 
data  sets  1,  2,  3,  and  6  show  this  shape,  they  are  the  ones  that  can  produce 
hairpins  in  the  scatter  plots  of  0  and  d.   Two  of  these  four  hairpins 
and  6)  have  such  narrow  arches  that  the  scatter  plots  of  the  two  legs  do  not 
separate  (see  Figures  1  and  2) .   We  note  that  the  plot  of  d  vs  u  for  Data 
Sets  four  and  five  is  suggestive  of  a  hairpin  whose  bend  point  is  not  in  the 
center  but  deep  in  the  tail.   The  corresponding  plots  of  0  vs  u  do  not 
confirm  the  effect  however. 

The  assignment  of  negative  signs  to  the  left  legs  of  these  'V*  shaped 
plots  must  be  preceded  by  the  translations. 


d  «-  d  -  d       and      0  «-  0Q 


Using  the   local  median  of    three  technique  explained  in  Section  II,    these 
values  are 

Data  Set                      ONE  TWO            THREE             FOUR  FIVE            SIX 

D                             2.20  6.07          15.10  15.19  9.74          15.19 


0 


0028    .0030    .0023    -0077     .0016     .0017 


Having  made  these  translations,  negative  signs  are  attached  to  all  values  of 
d(0)  whose  indices  (computed  from  the  ranks  of  u)  are  less  than  or  equal 
the  index  of  d,(0)  =  zero  (i.e.  formerly  d  ,(0  )). 
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This  done,  the  Figures  1  and  2  are  converted  to  their  signed  versions, 
Figures  5  and  6.   The  result  appears  to  be  reasonable  in  the  four  hairpin 
designated  cases  (Data  Sets  1,  2,  3  and  6).   A  look  at  the  center  of  the 
plots  for  Data  Sets  2  and  3  suggests  that  the  values  for  d  and  0  might  be 
better  chosen.   The  result  for  Data  Set  5  may  not  be  as  good  as  the  original 
untransf ormed  plot. 

Turning  to  the  application  of  our  ad  hoc  test  for  the  bivariate  normal 
distribution,  recall  that  the  data  are  transformed  first  to  the  circular 
normal  and  then  to  polar  form.   Figure  7  contains  the  scatter  plots  of  the 
squared  radius  vs  the  polar  angle.   It  was  hoped  that  these  could  show 
independence.   The  zero  regression  aspect  of  independence  seems  to  be 
present,  but  at  least  three  of  the  plots  (Data  Sets  1,  3,  5)  exhibit  some 
sort  of  cyclical  effect;  large  radii  are  associated  with  regularly  spaced 
polar  angles. 

Figures  8  and  9  contain  Q  -  Q  plots;   empirical  quantiles  in  the 
horizontal  and  theoretical  quantiles  in  the  vertical.   To  avoid  crowding, 
every  third  point  was  plotted.   In  order  to  support  the  distributional 
assumptions,  all  of  these  plots  should  be  reasonably  straight  lines.   It  is 
our  opinion  that  the  departures  from  straight  lines  are  sufficient  to  cast 
doubt  on  any  precise  use  of  the  bivariate  normal  distribution  for  the 
scatter  plots  in  Figures  5  and  6. 
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IV.   Summary 

The  methodology  developed  here  is  certainly  useful  but  there  are 
aspects  of  the  data  that  are  not  yet  understood.  More  and  higher  quality 
data  is  needed  in  order  to  continue  the  development.   The  main  goal  is  to 
substitute  two  dimensional  data  for  six  dimensional  data.   In  doing  so,  it 
is  hoped  that  one  can  profitably  use  popular  bivariate  probability  laws  to 
describe  these  references  distributions. 
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